They will include idioms, sayings, phrasal verbs, so brush up on your idiomatic expressions, don't " throw in the towel" , and you will " sail through" the exam with " flying colours" .
This trick hinges on the observation that for every reward model there is a specific theoretical LLM that would get full marks, and every LLM likewise has a theoretical reward model that would give it flying colours.